


5 O 

T3 




3 



t 





o 
13 



o 



•a » 





•fl 


o 






o 
o 




3 





Figure 2 



Gene Sequence of sddl7, an Omega-3 Fatty Acid Desaturase Gene 
from Saprolegnia diclina (ATCC 56851) 



1 


ATGACTGAGG 


ATAAGACGAA 


GGTCGAGTTC 


CCGACGCTCA 


CGGAGCTCAA 


51 


GCACTCGATC 


CCGAACGCGT 


GCTTTGAGTC 


GAACCTCGGC 


CTCTCGCTCT 


101 


ACTACACGGC 


CCGCGCGATC 


TTCAACGCGT 


CGGCCTCGGC 


GGCGCTGCTC 


151 


TACGCGGCGC 


GCTCGACGCC 


GTTCATTGCC 


GATAACGTTC 


TGCTCCACGC 


201 


GCTCGTTTGC 


GCCACCTACA 


TCTACGTGCA 


GGGCGTCATC 


TTCTGGGGCT 


251 


TCTTCACGGT 


CGGCCACGAC 


TGCGGCCACT 


CGGCCTTCTC 


GCGCTACCAC 


301 


AGCGTCAACT 


TTATCATCGG 


CTGCATCATG 


CACTCTGCGA 


TTTTGACGCC 


351 


GTTCGAGAGC 


TGGCGCGTGA 


CGCACCGCCA 


CCACCACAAG 


AACACGGGCA 


401 


ACATTGATAA 


GGACGAGATC 


TTTTACCCGC 


ACCGGTCGGT 


CAAGGACCTC 


451 


CAGGACGTGC 


GCCAATGGGT 


CTACACGCTC 


GGCGGTGCGT 


GGTTTGTCTA 


501 


CTTGAAGGTC 


GGGTATGCCC 


CGCGCACGAT 


GAGCCACTTT 


GACCCGTGGG 


551 


ACCCGCTCCT 


CCTTCGCCGC 


GCGTCGGCCG 


TCATCGTGTC 


GCTCGGCGTC 


601 


TGGGCCGCCT 


TCTTCGCCGC 


GTACGCGTAC 


CTCACATACT 


CGCTCGGCTT 


651 


TGCCGTCATG 


GGCCTCTACT 


ACTATGCGCC 


GCTCTTTGTC 


TTTGCTTCGT 


701 


TCCTCGTCAT 


TACGACCTTC 


TTGCACCACA 


ACGACGAAGC 


GACGCCGTGG 


751 


TACGGCGACT 


CGGAGTGGAC 


GTACGTCAAG 


GGCAACCTCT 


CGAGCGTCGA 


801 


CCGCTCGTAC 


GGCGCGTTCG 


TGGACAACCT 


GAGCCACCAC 


ATTGGCACGC 


851 


ACCAGGTCCA 


CCACTTGTTC 


CCGATCATTC 


CGCACTACAA 


GCTCAACGAA 


901 


GCCACCAAGC 


ACTTTGCGGC 


CGCGTACCCG 


CACCTCGTGC 


GCAGGAACGA 


951 


CGAGCCCATC 


ATCACGGCCT 


TCTTCAAGAC 


CGCGCACCTC 


TTTGTCAACT 


1001 


ACGGCGCTGT 


GCCCGAGACG 


GCGCAGATCT 


TCACGCTCAA 


AGAGTCGGCC 


1051 


GCGGCCGCCA AGGCCAAGTC 


GGACTAA 







Figure 3 



Amino Acid Sequence of an Omega-3 Fatty Acid Desaturase (SDD17) from Saprolegma 

diclina (ATCC 56851) 



1 MTEDKTKVEF PTLTELKHSI PNACFESNLG LSLYYTARAI FNASASAALL 

51 YAARSTPFIA DNVLLHALVC ATYIYVQGVI FWGFFTVGHD CGHSAFSRYH 

101 SVNFIIGCIM HSAILTPFES WRVTHRHHHK NTGNIDKDEI FYPHRSVKDL 

151 QDVRQWVYTL GGAWFVYLKV GYAPRTMSHF DPWDPLLLRR ASAVIVSLGV 

201 WAAFFAAYAY LTYSLGFAVM GLYYYAPLFV FASFLVI TTF LHHNDEATPW 

251 YGDSEWTYVK GNLSSVDRSY GAFVDNLS HH IGTHQVHHLF PIIPHYKLNE 

301 ATKHFAAAYP HLVRRNDEP I I TAFFKT AHL FVNYGAVPET AQIFTLKESA 

351 AAAKAKSD* 



Figure 4 



Comparative analysis of S. diclina Delta 17-desaturase (SDD17.pep) 

& Synechocystis sp. Delta 15-desaturase (SYCDESB) 



Frarre: 2 initn: 733 initl: 305 opt: 689 Z-score: 996.8 expect(): 1.5e-47 
40.9% identity in 269 aa overlap 
(76-336:204-471) 
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Figure 5 



Comparative analysis of S. diclina Delta 17-desaturase (SDD17.pep) 
C. elegans Delta 17-desaturase (CELEFAT) 



Frame: 1 initn: 490 initl: 222 opt: 502 Z-score: 724.0 expect(): 2.3e-32 
31.6% identity in 310 aa overlap 
(2-303:49-347) 
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Gene Sequence of sddl2, a Delta 12-Desaturase Gene 
from Saprolegnia diclina (ATCC 56851) 

1 ATGTGCAAAG GTCAAGCTCC TTCCAAGGCC GACGTGTTCC ACGCTGCGGG 

51 GTACCGCCCG GTCGCCGGCA CGCCCGAGCC GCTGCCGCTG GAGCCCCCGA 

101 CGATCACGCT CAAGGACCTG CGCGCGGCGA TCCCGGCCCA CTGCTTTGAG 

151 CGCAGCGCTG CCACTAGCTT TTACCATTTG GCCAAGAACC TTGCGATCTG 

201 CGCCGGCGTG TTCGCCGTTG GCCTCAAGCT CGCGGCTGCC GACTTGCCGC 

251 TCGCGGCCAA GCTGGTCGCG TGGCCCATCT ACTGGTTCGT CCAGGGCACG 

301 TACTTTACGG GCATCTGGGT CATTGCGCAC GAATGCGGCC ACCAGGCGTT 

351 CTCGGCGTCC GAGATCCTCA ACGACACGGT CGGTATCATT CTTCACTCGC 

401 TCCTCTTTGT GCCGTACCAC AGCTGGAAGA TCACGCACCG CCGCCACCAC 

451 TCCAACACGG GCAGCTGCGA GAACGACGAG GTGTTTACGC CGACGCCGCG 

501 GTCCGTCGTC GAGGCCAAGC ACGACCACTC GCTCCTCGAA GAGAGCCCGC 

551 TCTACAACCT GTACGGCATC GTCATGATGC TTCTCGTGGG CTGGATGCCG 

601 GGCTACCTCT TCTTCAACGC GACCGGCCCG ACCAAGTACG CTGGCCTCGC 

651 CAAGTCGCAC TTCAACCCGT ACGCAGCCTT TTTCCTCCCA AAGGAGCGCC 

701 TCAGCATCTG GTGGAGCGAC CTCTGCTTCC TCGCGGCCTT GTACGGCTTT 

751 GGCTACGGCG TCTCGGTCTT CGGCCTCCTC GATGTCGCCC GCCACTACAT 

801 CGTGCCGTAC CTCATTTGCA ACGCGTACCT CGTGCTCATC ACGTACCTCC 

851 AGCACACGGA TACGTACGTG CCCCACTTCC GCGGCGACGA GTGGAACTGG 

901 CTGCGCGGCG CGCTCTGCAC CGTCGACCGC TCGTTCGGCG CGTGGATCGA 

951 CAGCGCGATC CACCACATTG CCGACACGCA CGTGACGCAC CACATTTTCT 

1001 CCAAGACGCC CTTCTACCAC GCGATCGAGG CGACCGACGC CATCACGCCC 

1051 CTCCTCGGCA AGTACTACCT CATCGACCCG ACGCCGATCC CGCTGGCGCT 

1101 CTGGCGCTCG TTCACGCACT GCAAGTACGT CGAGGACGAC GGCAACGTTG 

1151 TGTTTTACAA GCGCAAGCTC GAGGAAAAGT AA 



Figure 7 

Amino Acid Sequence of a Delta 12-Desaturase (SDD12) from Saprolegnia diclina (ATCC 

56851) 



1 MCKGQAPSKA DVFHAAGYRP VAGTPEPLPL EPPTITLKDL RAAIPAHCFE 

51 RSAATSFYHL AKNLAICAGV FAVGLKLAAA DLPLAAKLVA WPIYWFVQGT 

101 YFTGIWVIAH ECGHQAFSAS EILNDTVGII LHSLLFVPYH SWKITHRRHH 

151 SNTGSCENDE VFTPTPRSW EAKHDHSLLE ESPLYNLYGI VMMLLVGWMP 

201 GYLFFNATGP TKYAGLAKSH FNPYAAFFLP KERLSIWWSD LCFLAALYGF 

251 GYGVSVFGLL DVARHYIVPY LICNAYLVLI TYLQHTDTYV PHFRGDEWNW 

301 LRGALCTVDR SFGAWIDSAI HHIADTHVTH HIFSKTPFYH AIEATDAITP 

351 LLGKYYLIDP TPIPLALWRS FTHCKYVEDD GNWFYKRKL EEK* 



Figure 8 



Comparative analysis of S. diclina Delta 12-desaturase (SDD12.pep) 
& G. hirsutum Delta 12-desaturase (GH06DES) 

Fraue: 3 initn: 992 initl: 413 opt: 1086 Z-score: 1531.8 expect(): 2.3e-77 

45 , 6% identity in 379 aa overlap 
(9-380:14-384) 
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Sequence ID: 

Sequence ID 1 

5'-ATC CGC GCC GCC ATC CCC AAG CAC TGC TGG GTC AAG-3 ' 
Sequence ID 2 

5'- GCC CTC TTC GTC CTC GGC CAY GAC TGC GGC CAY GGC TCG TTC Tec- 
s' 

Sequence ID 3 

5' -GAG RTG GTA RTG GGG GAT CTG GGG GAA GAR RTG RTG GRY GAC RTG - 3 ' 
Sequence ID 4 

5' -CCC TAC CAY GGC TGG CGC ATC TCG CAY CGC ACC CAY CAY CAG AAC-3' 
Sequence ID 5 

5'-GTT CTG RTG RTG GGT CCG RTG CGA GAT GCG CCA GCC RTG GTA GGG- 3' 
Sequence ID 6 

5'- GGC TCG CAC TTC SAC CCC KAC TCG GAC CTC TTC GTC- 3' 
Sequence ID 7 

5'- GAC GAA GAG GTC CGA GTM GGG GTW GAA GTG CGA GCC- 3' 
Sequence ID 8 

5'- GCG CTG GAK GGT GGT GAG GCC GCC GCG GAW GSA CGA CCA- 3' 
Sequence ID 9 

5' - CTG GGG GAA GAG RTG RTG GAT GAC RTG GGT GCC GAT GTC RTG RTG- 
3' 



Sequence ID 10 



5'- GGT GGC CTC GAY GAG RTG GTA RTG GGG GAT CTK GGG GAA GAR RTG- 
3' 



Sequence ID 11 

5' -GAG RTG GTA RTG GGG GAT CTG GGG GAA GAR RTG RTG GRY GAC RTG -3 
Sequence ID 12 

5'-TAC GCG TAG CTC ACG TAC TCG CTC G-3' 
Sequence ID 13 

5'-TTC TTG CAC CAC AAC GAC GAA GCG ACG- 3' 
Sequence ID 14 

5 ' -GGA GTG GAC GTA CGT CAA GGG CAA C-3' 
Sequence ID 15 

5'-TCA AGG GCA ACC TCT CGA GCG TCG AC -3 ' 
Sequence ID 16 

5'-CCC AGT CAC GAC GTT GTA AAA CGA CGG CCA G-3' 

Sequence ID 17 
5' - AGC GGA TAA CAA TTT CAC ACA GGA AAC AGC -3' 

Sequence ID 18 

5' -GGT AAA AGA TCT CGT CCT TGT CGA TGT TGC-3' 
Sequence ID 19 

5'-GTC AAA GTG GCT CAT CGT GC-3' 



Sequence ID 2 0 

5' -CGA GCG AGT ACG TGA GGT ACG CGT AC- 3' 



Sequence ID 21 

5' -TCA ACA GAA TTC ATG ACC GAG GAT AAG ACG AAG GTC GAG TTC CCG-3 



Sequence ID 22 

5' -AAA AGA AAG CTT CGC TTC CTA GTC TTA GTC CGA CTT GGC CTT GGC-3' 
Sequence ID 23 

5' -TCA ACA AAG CTT ATG ACC GAG GAT AAG ACG AAG GTC GAG TTC 
CCG-3' 

Sequence ID 24 

5 ' -AAA AGA GAA TTC CGC TTC CTA GTC TTA GTC CGA CTT GGC CTT 
GGC-3" 

Sequence ID 25 

1 ATGACTGAGG ATAAGACGAA GGTCGAGTTC CCGACGCTCA CGGAGCT CAA 



O -L 




CCGAACGCGT 


GCTTTGAGTC 


GAACCTCGGC 


CTCTCGCTCT 


101 






TT f A A CGCGT 


CGGCCTCGGC 


GGCGCTGCTC 


151 


TACGCGGCGC 


GCTCGACGCC 


GTTCATTGCC 


GAT AACGTT C 


TGCTCCACGC 


201 


GCTCGTTTGC 


GCCACCTACA 


TCTACGTGCA 


GGGCGTCATC 


TTCTGGGGCT 


251 


TCTTCACGGT 


CGGCCACGAC 


TGCGGCCACT 


CGGCCTTCTC 


GCGCTACCAC 


301 


AGCGTCAACT 


TTATCATCGG 


CTGC AT CATG 


CACTCTGCGA 


TTTTGACGCC 


351 


GTTCGAGAGC 


TGGCGCGTGA 


CGCACCGCCA 


CCACCACAAG 


AACACGGGCA 


401 


ACATTGATAA 


GGACGAGATC 


TTTTACCCGC 


ACCGGTCGGT 


CAAGGACCTC 


451 


CAGGACGTGC 


GCCAATGGGT 


CTACACGCTC 


GGCGGTGCGT 


GGTTTGTCTA 


501 


CTTGAAGGTC 


GGGTATGCCC 


CGCGCACGAT 


GAGCCACTTT 


GACCCGTGGG 


551 


ACCCGCTCCT 


CCTTCGCCGC 


GCGTCGGCCG 


TCATCGTGTC 


GCTCGGCGTC 


601 


TGGGCCGCCT 


TCTTCGCCGC 


GTACG CGTAC 


CTCACATACT 


CGCTCGGCTT 


651 


TGCCGTCATG 


GGC CTCTACT 


ACTATGCGCC 


GCTCTTTGTC 


TTTGCTTCGT 


701 


TCCTCGTCAT 


TACGACCTTC 


TTGCACCACA 


ACGACGAAGC 


GACGCCGTGG 


751 


TACGGCGACT 


CGGAGTGGAC 


GTACGTCAAG 


GGCAACCTCT 


CGAGCGTCGA 


801 


CCGCTCGTAC 


GGCGCGTTCG 


TGGACAACCT 


GAGCCACCAC 


ATTGGCACGC 


851 


ACCAGGTCCA 


CCACTTGTTC 


CCGATCATTC 


CGCACTACAA 


GCTCAACGAA 


901 


GCCACCAAGC 


ACTTTGCGGC 


CGCGTACCCG 


CACCTCGTGC 


GCAGGAACGA 


951 


CGAGCCCATC 


ATCACGGCCT 


TCTTCAAGAC 


CGCGCACCTC 


TTTGTCAACT 


1001 


ACGGCGCTGT 


GCCCGAGACG 


GCGCAGATCT 


TCACGCTCAA 


AGAGTCGGCC 


1051 


GCGGCCGCCA 


AGGCCAAGTC 


GGACTAA 







Sequence ID 2 6 

1 MTEDKTKVEF PTLTELKHSI PNACFESNLG LSLYYTARAI FNASASAALL 





51 


YAARSTPFIA DNVLLHALVC ATYIYVQGVI 


FWGFFTVGHD 


CGHSAFSRYH 




101 


SVNFIIGCIM 


HSAILTPFES 


WRVTHRHHHK 


TVTf-f-1 /~i -r -T- T~*N T7"T\T1 T 

NTGNIDKDEI 


FYPHRSVKDL 




1 R1 
j — j j- 


rvm tt> OTxn tvtt 
yjj VKyW v ilL 






D PWD PLLLRR 


AS AV I VS LGV 

nun v J_ v lj uv? v 




201 


WAAFFAAYAY 


LTYSLGFAVM 


GLYYYAPLFV 


FASFLVITTF 


LHHNDEATPW 




251 


YGDSEWTYVK 


GNLSSVDRSY 


GAFVDNLSHH 


IGTHQVHHLF 


P I I PHYKLNE 




j> U 1 


ATKHFAAAYP 


HLVRRNDEPI 


ITAFFKTAHL 


FVNYGAVPET 


AQIFTLKESA 




351 


AAAKAKSD* 










u 


Sequence 


ID 27 












1 


ATGGCCCCGC 


AGACGGAGCT 


CCGCCAGCGC 


CACGCCGCCG 


TCGCCGAGAC 


CSS? 


51 


GCCGGTGGCC GGCAAGAAGG CCTTTACATG GCAGGAGGTC GCGCAGCACA 




101 


ACACGGCGGC 


CTCGGCCTGG 


ATCATTATCC 


GCGGCAAGGT 


CTACGACGTG 


"SI 


151 


AC CG AGTGGG 


CCAACAAGCA 


CCCCGGCGGC 


CGCGAGATGG 


TGCTGCTGCA 


h s 


201 


CGCCGGTCGC 


GAGGCCACCG 


ACACGTTCGA 


CTCGTACCAC 


CCGTTCAGCG 




251 


ACAAGGCCGA 


GTCGATCTTG 


AACAAGTATG 


AGATTGGCAC 


GTTCACGGGC 


f 5 "= 


301 


CCGTCCGAGT 


TTCCGACCTT 


CAAGCCGGAC 


ACGGGCTTCT 


ACAAGGAGTG 


ft 


351 


CCGCAAGCGC 


GTTGGCGAGT 


ACTTCAAGAA 


GAACAACCTC 


CATCCGCAGG 


E5 


401 


ACGGCTTCCC 


GGGCCTCTGG 


CGCATGATGG 


TCGTGTTTGC 


GGTCGCCGGC 




451 


CTCGCCTTGT 


ACGGCATGCA 


CTTTT CGACT 


ATCTTTGCGC 


TGCAGCTCGC 


ry 


501 


GGCCGCGGCG 


CTCTTTGGCG 


TCTGCCAGGC 


GCTGCCGCTG 


CTCCACGTCA 




551 


TGCACGACTC 


GTCGCACGCG 


TCGTACACCA 


ACATGCCGTT 


CTTCCATTAC 




601 


GTCGTCGGCC 


GCTTTGCCAT 


GG ACT GGTTT 


GCCGGCGGCT 


CGATGGTGTC 




651 


ATGGCTCAAC 


CAGCACGTCG 


TGGGCCACCA 


CATCTACACG 


AACGTCGCGG 




701 


GCTCGGACCC 


GGATCTTCCG 


GT CAACATGG 


ACGGCGACAT 


CCGCCGCATC 




751 


GTGAACCGCC 


AGGTGTTCCA 


GCCCATGTAC 


GCATTCCAGC 


ACATCTACCT 




801 


TCCGCCGCTC 


TATGGCGTGC 


TTGGCCTCAA 


GTTCCGCATC 


CAGGACTTCA 




851 


CCGACACGTT 


CGGCTCGCAC 


ACGAACGGCC 


CGATCCGCGT 


CAACCCGCAC 




901 


GCGCTCTCGA 


CGTGGATGGC 


CATGAT CAGC 


TCCAAGTCGT 


TCTGGGCCTT 




951 


CTACCGCGTG 


TACCTTCCGC 


TTGCCGTGCT 


CCAGATGCCC 


ATCAAGACGT 




1001 


ACCTTGCGAT 


CTTCTTCCTC 


GCCGAGTTTG 


TCACGGGCTG 


GTACCTCGCG 




1051 


TTCAACTTCC 


AAGTAAGCCA 


TGTCTCGACC 


GAGTGCGGCT 


ACCCATGCGG 




1101 


CGACGAGGCC 


AAGATGGCGC 


T C CAGGACGA 


GTGGGCAGTC 


TCGCAGGTCA 



1151 


AGACGTCGGT 


CGACTACGCC 


CATGGCTCGT 


GGATGACGAC 


GTTCCTTGCC 


1201 


GGCGCGCTCA 


ACTACCAGGT 


CGTGCACCAC 


TTGTTCCCCA 


GCGTGTCGCA 


1251 


GTACCACTAC 


CCGGCGATCG 


CGCCCATCAT 


CGTCGACGTC 


TGCAAGGAGT 


1301 


AC AAC AT C AA 


GTACGCCATC 


TTGCCGGACT 


TTACGGCGGC 


GTTCGTTGCC 


1351 


CACTTGAAGC 


ACCTCCGCAA 


CATGGGCCAG 


CAGGGCATCG 


CCGCCACGAT 


1401 


CCACATGGGC 


TAA 









Sequence ID 28 



1 


ATGGCAAACA 


GCAGCGTGTG 


GGATGATGTG 


GTGGGCCGCG 


TGGAGACCGG 


51 


CGTGGACCAG 


TGGATGGATG 


GCGCCAAGCC 


GTACGCACTC 


ACCGATGGGC 


101 


TCCCGATGAT 


GGACGTGTCC 


ACCATGCTGG 


CATTCGAGGT 


GGGATACATG 


151 


GCCATGCTGC 


TCTTCGGCAT 


CCCGATCATG 


AAGCAGATGG 


AGAAGCCTTT 


201 


TGAGCTCAAG 


ACCATCAAGC 


TCTTGCACAA 


CTTGTTTCTC 


TTCGGACTTT 


251 


CCTTGTACAT 


GTGCGTGGAG 


ACCATCCGCC 


AGGCTATCCT 


CGGAGG CT AC 


301 


AAAGTGTTTG 


GAAACGACAT 


GGAGAAGGGC 


AACGAGTCTC 


ATGCTCAGGG 


351 


CATGTCTCGC 


ATCGTGTACG 


TGTTCTGCGT 


GTCCAAGGCA 


T ACGAGTT CT 


401 


TGGAT AC CGC 


CATCATGATC 


CTTTGCAAGA 


AGTTCAACCA 


GGTTTCCTTC 


451 


TTGCATGTGT 


ACCACCATGC 


CACCATTTTT 


GCCATCTGGT 


GGGCTATCGC 


501 


CAAGTACGCT 


CCAGGAGGTG 


ATGCGTACTT 


TTCAGTGATC 


CTCAACTCTT 


551 


TCGTGCACAC 


CGT CATGT AC 


GCATACTACT 


TCTTCTCCTC 


CCAAGGGTTC 


601 


GGGTTCGTGA 


AGCCAATCAA 


GCCGTACATC 


ACCACCCTTC 


AGATGACCCA 


651 


GTTCATGGCA 


ATGCTTGTGC 


AGTCCTTGTA 


CGACTACCTC 


TTCCCATGCG 


701 


ACTACCCACA 


GGCTCTTGTG 


CAGCTTCTTG 


GAGTGTACAT 


GATCACCTTG 


751 


CTTGCCCTCT 


TCGGCAACTT 


TTTTGTGCAG 


AGCT AT CTT A 


AAAAGCCAAA 


801 


AAAGAGCAAG 


ACCAACTAA 









Sequence ID 29 

1 MTVGFDETVT MDTVRNHNMP DDAWCAI HGT VYDITKFSKV HPGGDIIMLA 

51 AGKEATI LFE TYHIKGVPDA VLRKYKVGKL PQGKKGETSH MPTGLDSASY 
101 YSWDSEFYRV LRERVAKKLA EPGLMQRARM ELWAKAIFLL AGFWGSLYAM 
151 CVLDPHGGAM VAAVTLGVFA AFVGTCIQHD GSHGAFSKSR FMNKAAGWTL 
201 DM I GAS AMT W EMQHVLGHHP YTNLIEMENG LAKVKGADVD PKKVDQESDP 
251 DVFSTYPMLR LHPWHRQRFY HKFQHLYAPL IFGFMTINKV ISQDVGWLR 
301 KRLFQIDANC RYGSPWNVAR FWTMKLLTTL YMVALPMYMQ GPAQGLKLFF 



351 MAHFTCGEVL ATMFIVNHII 

401 QKALSAAEST KSDADKTTMI 

451 NHQIEHHCFP QNPHTVNVYI 

501 HLRTLGNEDL TAWST* 

Sequence ID 3 0 

5'-CCG SAG TTC ACS ATC AAG 
CAC TGC TTC -3' 

Sequence ID 31 

5' -GRS CTT CTT GAK GTG GWM 
CGG CAT- 3' 



EGVSYASKDA VKGVMAPPRT VHGVTPMQVT 
PLNDWAAVQC QTSWWAVGS WFWNHFSGGL 
SGIVKETCEE YGVPYQAEIS LFSAYFKMLS 



GAG ATC CGC GAS KSC ATC CCG GCC 
SGT GGC CTC CTC GGC GTG GTA GWR 



Sequence ID 32 

5 ' -CCS STC TAC TGG GCC TGC CAG GGT RTC GTC CTC ACS GGT GTC 
TGG-3 ' 

Sequence ID 33 

5' -CCS STC TAC TGG ATC RYS CAG GGT RTC GTC KGY ACS GGT GTC 
TGG-3 ' 

Sequence ID 34 

5' -GGC GTG GTA GTG CGG CAT SMM CGA GAA GAR GTG GTG GGC GAC 
GTG- 3' 

Sequence ID 35 

5' -CAC GTA CCT CCA GCA CAC GGA CAC CTA CG-3' 

Sequence ID 3 6 

5'- GAT CGA CAG CGC GAT CCA CCA CAT TGC- 3' 



Sequence ID 37 

5' - CAA ATG GTA AAA GCT AGT GGC AGC GCT GC-3' 



Sequence ID 38 

5'-AGT ACG TGC CCT GGA CGA ACC AGT AGA TG-3' 



Sequence ID 3 9 

5 ' - TCA ACA GAA TTC ATG TGC AAA GGT CAA GCT CCT TCC AAG GCC 
GAC GTG -3' 

Sequence ID 4 0 

5 ' - AAA AGA AAG CTT TTA CTT TTC CTC GAG CTT GCG CTT GTA AAA 
CAC AAC-3 ' 

Sequence ID 41 

1 ATGTGCAAAG GTCAAGCTCC TTCCAAGGCC GACGTGTTCC ACGCTGCGGG 



O X 


G 1 AG GGG C C G 


G 1 CGCGGGGA 


CGCCCGAGCG 


GCTGCCGCTG 


GAGCCCCCGA 


-LU -L 


GGA1 CAGGC 1 


CAAGGAGG i G 


CGCGCGGGGA 


1 GGGGGGCCA 


CTGCTTTGAG 


151 


CGCAGCGCTG 


CCACTAGCTT 


TTACCATTTG 


GCCAAGAACC 


TTGCGATCTG 


201 


CGCCGGCGTG 


TTCGCCGTTG 


GCCTCAAGCT 


CGCGGCTGCC 


GACTTGCCGC 


251 


TCGCGGCCAA 


GCTGGTCGCG 


TGGCCCATCT 


ACTGGTTCGT 


CCAGGGCACG 


301 


TACTTTACGG 


GCATCTGGGT 


CATTGCGCAC 


GAATGCGGCC 


ACCAGGCGTT 


351 


CTCGGCGTCC 


GAGATCCTCA 


ACGACACGGT 


CGGTAT CATT 


CTTCACTCGC 


401 


TCCTCTTTGT 


GCCGTACCAC 


AGCTGGAAGA 


TCACGCACCG 


CCGCCACCAC 


451 


TCCAACACGG 


GCAGCTGCGA 


GAACGACGAG 


GTGTTTACGC 


CGACGCCGCG 


501 


GTCCGTCGTC 


GAGGCCAAGC 


ACGACCACTC 


GCTCCTCGAA 


GAGAGCCCGC 


551 


TCTACAACCT 


GTACGGCATC 


GTCATGATGC 


TTCTCGTGGG 


CTGGATGCCG 


601 


GGCTACCTCT 


TCTTCAACGC 


GACCGGCCCG 


ACCAAGTACG 


CTGGCCTCGC 


651 


CAAGTCGCAC 


TTCAACCCGT 


ACGCAGCCTT 


TTTCCTCCCA 


AAGGAGCGCC 


701 


TCAGCATCTG 


GTGGAGCGAC 


CTCTGCTTCC 


TCGCGGCCTT 


GTACGGCTTT 


751 


GGCTACGGCG 


TCTCGGTCTT 


CGGCCTCCTC 


GATGTCGCCC 


GCCACTACAT 


801 


CGTGCCGTAC 


CTCATTTGCA 


ACGCGTACCT 


CGTGCTCATC 


ACGTACCTCC 


851 


AGCACACGGA 


TACGTACGTG 


CCCCACTTCC 


GCGGCGACGA 


GTGGAACTGG 


901 


CTGCGCGGCG 


CGCTCTGCAC 


CGTCGACCGC 


TCGTTCGGCG 


CGTGGATCGA 


951 


CAGCGCGATC 


CACCACATTG 


CCGACACGCA 


CGTGACGCAC 


CACATTTTCT 


1001 


CCAAGACGCC 


CTTCTACCAC 


GCGATCGAGG 


CGACCGACGC 


CATCACGCCC 


1051 


CTCCTCGGCA 


AGTACTACCT 


CATCGACCCG 


ACGCCGATCC 


CGCTGGCGCT 


1101 


CTGGCGCTCG 


TTCACGCACT 


GCAAGTACGT 


CGAGGACGAC 


GGCAACGTTG 



1151 TGTTTTACAA GCGCAAGCTC GAGGAAAAGT AA 
Sequence ID 42 

1 MCKGQAPSKA DVFHAAGYRP VAGTPEPLPL EPPTITLKDL RAAIPAHCFE 

51 RSAATSFYHL AKNLAI CAGV FAVGLKLAAA DLPLAAKLVA WPIYWFVQGT 
101 YFTGIWVIAH ECGHQAFSAS EILNDTVGII LHSLLFVPYH SWKITHRRHH 
151 SNTGSCENDE VFTPTPRSW EAKHDHSLLE ESPLYNLYGI VMMLLVGWMP 
201 GYLFFNATGP TKYAGLAKSH FNPYAAFFLP KERLSIWWSD LCFLAALYGF 
251 GYGVSVFGLL DVARHY I VP Y LICNAYLVLI TYLQHTDTYV PHFRGDEWNW 
301 LRGALCTVDR SFGAWIDSAI HHIADTHVTH HIFSKTPFYH AIEATDAITP 
351 LLGKYYL I DP TPIPLALWRS FTHCKYVEDD GNWFYKRKL EEK* 



